Muhammad Omair SHAFIQ Hafiz Farooq AHMAD Hiroki SUGURI Arshad ALI
Grid computing is an open, heterogeneous and highly dynamic environment based on the principles of service oriented computing. It focuses on basic infrastructure for coordinated resource sharing among virtual organizations to achieve high performance and availability. However, use of existing Grid computing environment is quite complex and requires a lot of human intervention. In order to avoid this intervention, enhancements are required in bringing autonomy and semantics in existing Grid infrastructure. Semantics would act as glue for autonomy in the process of efficient resource discovery and utilization. Several ontologies and ontology languages have been proposed in this regard which not only have some shortcoming but also poses a sort of overhead for the Grid environment. On the other hand, agents are autonomous problem solving entities, and can negotiate semantically for interoperation with each other in dynamic environments. Inspired from the concept of Autonomous Decentralized Systems, we propose that the above mentioned goals can be achieved by integrating FIPA Multi Agent Systems with the Grid Service Architecture and hence to lay the foundation for Autonomous Semantic Grid. Autonomous Semantic Grid system architecture is aimed to provide an improved infrastructure by bringing autonomy, semantic interoperability and decentralization in the Grid computing for emerging applications. This paper then presents implementation details of first milestone toward Autonomous Semantic Grid realization based on a middleware, namely AgentWeb Gateway for integration of Multi Agent Systems and Grid Service Architecture. Evaluation of the system has also been performed over a number of application scenarios.
Chao-Cheng WEN Yuan-Sun CHU Kim-Joan CHEN
Grid computing is a state-of-the-art parallel computing technology which enables worldwide computers to dynamically share their computing powers and resource to each other. The grid takes advantage of Internet as a universal communication platform to carry messages. Basically, Internet doesn't guarantee loss-free and ordered transmission, hence, the grid should keep the cause and effect of events by itself to ensure the correct ordering of command invocations at the remote hosts. The ordering issue arises when the messages travel across the networks with unpredictable delay. Recent research has studied the security and resource control issues, but failed to address the requirements of transport layer on the grid communication platform. In this paper, we propose the Causal Ordered Grid (COG) architecture and implement it to study the transport performance issues when the grid is built over worldwide networks. The COG provides a novel service model to the applications with time-sensitive and causal-ordered transportation. From our experiments, the design of the grid middleware should use a causal-ordered, time-sensitive transportation rather than TCP. Our research will be beneficial to the improvement of the grid computing and can provide wealthy empirical results for the designer.
Jacobo TARRIO Juan TOURIÑO María J. MARTIN Patricia GONZALEZ Ramon DOALLO
Grid computing can help to promote high-performance computing at a low overall cost by encouraging research centers to share their resources. However, research staff usually finds it quite hard to use Grids effectively, due to the need of installing and managing new Grid software. Thus, Grid portals are created, making it easier to take advantage of the full capability of the Grid, favoring in this way its use. The goal of this paper is to describe the process of design and implementation of a Grid Portal with the aim of both supporting distributed high-performance resources and make its use by researchers as transparent as possible. This portal uses standard Grid and Web technologies. We have designed the portal so that it can be adapted to different existing Grid infrastructures, based on the Globus Toolkit, and new functionalities can be easily added. The first prototype of the portal has been tested on an experimental Grid platform, and we present encouraging experiences carried out there.
Rodrigo Fernandes de MELLO Erico C. T. de MATTOS Luis Carlos TREVELIN Maria Stela Veludo de PAIVA Laurence T. YANG
The availability of a low cost hardware has increased the development of distributed systems, by making then more and more accessible. In order to optimize the resources allocation on the distributed systems, some load balancing algorithms have been proposed. These algorithms distribute the application loads over the environment computers, make homogeneous the occupation of the whole environment and increase the application performance. This equal distribution prevents certain computers to get overloaded, to the detriment of the idleness of the other ones. This article proposes and analyzes the TLBAGrid, a load balancing algorithm for Grid computing environments.
Namyoon WOO Hyungsoo JUNG Heon Young YEOM Taesoon PARK Hyungwoo PARK
Fault-tolerance is an essential feature of the distributed systems where the possibility of a failure increases with the growth of the system. In spite of extensive researches over two decades, fault-tolerance systems have not succeeded in practical use. It is due to the high overhead and the unhandiness of the previous fault-tolerance systems. In this paper, we propose MPICH-GF, a user-transparent checkpointing system for grid-enabled MPICH. Our objectives are to fill the gap between the theory and the practice of fault-tolerance systems, and to provide a checkpointing-recovery system for grids. To build a fault-tolerant MPICH version, we have designed task migration, dynamic process management, and atomic message transfer. MPICH-GF requires no modification of application source codes, and it affects the MPICH communication characteristics as less as possible. The features of MPICH-GF are that it supports the direct message transfer mode and that all of the implementation has been done at the lower layer, that is, the abstract device level. We have evaluated MPICH-GF using NPB applications on Globus middleware.
Lee-Sub LEE Soo-Hyun PARK Doo-Kwon BAIK
Providing workflow function is one of the most important research issues in the next generation Internet services such as Web Service and Grid Computing. Scalability for Internet scale services, reliability for unstable Internet resources, and management functions of workflow systems are the essential requirements in these environments. However, existing workflow enactment models for enterprises could not meet these requirements. This paper proposes the PeerFlow that is a P2P based workflow enactment model, to provide workflow functions for the next generation Internet services. To apply P2P model to the workflow enactment model, we introduce the concept of the instance buddy and the index data of workflow instances, then propose the principle architecture of the PeerFlow. The instance buddy enables the autonomous processing of peers, and it is used for recovery and monitoring functions. This paper also presents the recovery capabilities of PeerFlow with formal proofs for the reliability issues and a performance evaluation with SimPy, the Python simulation package.
Yiyuan GONG Morikazu NAKAMURA Takashi MATSUMURA Kenji ONAGA
In this paper we propose a parallel and distributed computation of genetic local search with irregular topology in distributed environments. The scheme we propose in this paper is implemented with a tree topology established on an irregular network where each computing element carries out genetic local search on its own chromosome set and communicates with its parent when the best solution of each generation is updated. We evaluate the proposed algorithm by a simulation system implemented on a PC-cluster. We test our algorithm on four types topologies: star, line, balanced binary tree and sided binary tree, and investigate the influence of communication topology and delay on the evolution process.